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Abstract. In this paper we derive the generating function of RNA structures with pseudoknots. 
We enumerate all fc-noncrossing RNA pseudoknot structures categorized by their maximal sets of 
mutually intersecting arcs. In addition we enumerate pseudoknot structures over circular RNA. 
For 3-noncrossing RNA structures and RNA secondary structures we present a novel 4-term 
recursion formula and a 2-term recursion, respectively. Furthermore we enumerate for arbitrary 
k all fe-noncrossing, restricted RNA structures i.e. fc-noncrossing RNA structures without 2-arcs 
i.e. arcs of the form (i, i + 2), for 1 < i < n — 2. 



1. Introduction 



In this paper we study the combinatorics of helical structures of RNA sequences. RNA is described 
by its primary sequence of nucleotides A, G, U and C together with the Watson-Crick (A-U, 
G-C) and (U-G) base pairing rules specifying which pairs of nucleotides can potentially form 
bonds. Subject to these single stranded RNA form helical structures. The function of many RNA 
sequences depends on their structures. Therefore it is important to understand RNA structure in 
the context of studying the function of biological RNA as well as in the design process of artificial 
RNA structures. Since RNA is capable of catalytic activity, for instance RNA ribozymes can cleave 
other RNA molecules, it is believed that RNA may have been instrumental for early evolution, 
before Proteins emerged. A particularly well-studied sub-class of RNA structures, consisting of 
planar graphs arc the RNA secondary structures. Their combinatorics was pioneered by Waterman 
et. al. in a series of seminal papers [HI [17l [16j 123 H2] • RNA secondary structures are coarse grained 
structures and systematic prediction of the full three dimensional structures, the tertiary structures 
seems at present time to be out of reach. It was shown in [18] that the prediction of secondary 
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structures can be obtained in polynomial time and their combinatorics, specifically the existence of 
recursion relations is the key for all folding algorithms [T5l [9] . Over the last two decades a variety 
of prediction algorithms, based on minimum free energy [151 1181 125|. kinetic folding [14j or the 
partition function |T3] for RNA secondary structures has been derived. 

An increasing number of experimental findings, as well as results from comparative sequence anal- 
ysis imply that there exist additional types of interactions between RNA nucleotides [7]. These 
bonds are called pseudoknots and occur in functional RNA like for instance RNAseP [3] as well 
as ribosomal RNA [5J. RNA pseudoknots are conserved also in the catalytic core of group I in- 
trons. In plant viral RNAs pseudoknots mimic tRNA structure and in in vitro RNA evolution 
[4] experiments have produced families of RNA structures with pseudoknot motifs, when binding 
HIV-1 reverse transcriptase. In addition important mechanisms like ribosomal frame shifting [5] 
also involve pseudoknot interactions. As a result RNA pseudoknot structures have drawn over the 
last years a lot of attention [1] . Several folding algorithms [22J [26j [24j [21] have been developed 
which include certain families of pseudoknots. The prediction problem in general is (although we 
have not seen formal proof) believed to be NP-hard. In difference to RNA secondary structures a 
recursive enumeration for pseudoknot RNA is believed to be non-trivial but nevertheless of vital 
importance for prediction algorithms. Intuitively if bonds can cross it is much harder to enumerate 
since structural elements can now interact and as a result a structure cannot be straighforwardly 
decomposed into independent sub-structures. Little is known with respect to the combinatorics of 
pseudoknot RNA structures. Stadler et al. [8j suggested a classification of their knot-types based 
on a notion of inconsistency graphs and provided an upper bound for a certain class of pseudoknots 
(our 3-noncrossing RNA structures). 

In this paper we introduce a novel approach for the enumeration of RNA structures. Based on 
new concepts in enumerative combinatorics |28l 1 10] we use a method which has the potential to 
offer insight also into other lattice structure concepts. To be precise Chen et.al. have shown in [28] 
that there is a bijection between certain types of matchings and walks inside Weyl-chambcrs. This 
bijection is obtained via his construction of oscillating tableaux i.e. families of Young diagrams 
in which any two consecutive shapes differ by exactly one square. The corresponding walks can 
then be enumerated via determinant formulas derived from a reflection principle due to Gessel 
and Zeilberger |10j and Lindstrom [3]. The key idea behind the reflection principle is that walks 
which hit the wall of a Wcyl-chamber can be reflected. The original (unreflected) and the reflected 
walk cancel themselves leaving just the walks that never hit a wall. Crucial for its applicability 
are restrictive symmetry assumptions since the reflected walk has to be of the same type and, 
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more importantly, the reflection itself can occur at any step. These symmetries are non-existent 
in walks corresponding to RNA structures. However, our derivation of the generating function of 
RNA structures is based on these symmetric walks. The key idea is to introduce the asymmetries 
of RNA structures into the symmetric walks using a certain involution idea. We believe that our 
particular strategy can be applied for the enumeration of further structure classes. As a result we 
have tried to keep this paper self contained. 

Our main result is the enumeration of all RNA structures. We classify RNA structures by their 
specific crossing types under the assumption that all base pairs can occur. For arbitrary but fixed 
k we enumerate all RNA structures with no fc-set of mutually intersecting bonds. In case of k = 2, 
our results reduce to noncrossing structures, i.e. RNA secondary structures [27] and Waterman's 
formula for the number of RNA secondary structures with exactly k bonds is derived directly. 
The case k = 3 coincides with Stadlcr's bi-sccondary structure [S]. We obtain from the generating 
function a novel 4-term recursion formula for RNA structures of length n with no 3-set of mutually 
intersecting bonds and having t isolated vertices. We believe that this recursion is the key for 
developing new prediction algorithms for RNA structures. Also we derive the generating function 
for circular RNA structures i.e. for sequences where the bond between 1 and n is considered part 
of the primary sequence. Finally we enumerate restricted RNA structures, i.e. structures in which 
two interacting nucleotides have at least distance 3. 

The paper is structured as follows. We will begin by introducing several important combinatorial 
concepts needed for our derivations. Young tableaux, oscillating Young diagrams, RSK algorithm, 
Wcyl-chambers and the reflection principle. We discuss these concepts, illustrate all key ideas and 
give the corresponding proofs in the Appendix. Our derivation is obtained in three steps. First 
(Theorem [TJ wc show that each structure (represented as a fc-noncrossing digraph) corresponds 
uniquely to a walk starting and ending at (fc — 1, k — 2, . . . , 1) in Z fe_1 and which never touches a 
wall of the Weyl-chamber Co- Secondly we apply the reflection principle (Theorem [2]) in order to 
count the symmetric walks that remain in the interior of Cq. Thirdly (Theorem [3]) we incorporate 
the specific properties of RNA into these symmetric walks and compute the generating function 
of fc-noncrossing RNA structures. We show how our results relate to known formulas of RNA 
secondary structures for which wc present a two new term recursion formula. For 3-noncrossing 
RNA structures we give a novel 4-term recursion formula. We finally generalize our strategy 
(Theorem [5]) and enumerate restricted RNA structures. 
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2. From structures to walks and back 

Let us begin by illustrating the concept of RNA structures. Suppose we are given the primary 
sequence 

AACCAUGUGGUACUUGAUGGCGAC . 

Structures are combinatorial graphs over the labels of the nucleotides of the primary sequence. 
These graphs can be represented in several ways. In Figure 1 wc represent a particular structure 
with loop-loop interactions in two ways: first we display the structure as a planar graph and 
secondly as a diagram, where the bonds are drawn as arcs in the positive half-plane. 




FIGURE 1. Two representations of RNA structures, planar graphs (top) and diagrams (bottom) 

In the following we will consider structures as diagram representations of digraphs. A digraph D n 
is a pair of sets Vd„, Eo n , where Vd„ = {1, • • ■ ,n} and Eo n C | 1 < i < j < n}. Vd„ and 

Ejj n are called vertex and arc set, respectively A fc-noncrossing digraph, Gk. n , is a digraph in 
which all vertices have degree < 1 and which does not contain a fc-sct of arcs that are mutually 
intersecting, i.e. 

(2.1) (i ri , jVi); (*r 2 i jVa)) ■ ■ ■ i (Vfc j jr k ) 5 in < Va < ' ' ' < h- k < jri *^ Jra < ' ' ' < jr k ■ 

The set of all fc-noncrossing digraphs Gk, n is denoted by Sn,k and we set G n ,k = |Sn,fe|- The (formal) 
direction of the edges will have procedural convenience when we map a fc-noncrossing digraph into 
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an oscillating tableaux (Theorem [T]) . We will represent digraphs as a diagrams (Figure [T]) by 
representing the vertices as integers on a line and connecting any two adjacent vertices by an arc 
in the upper-half plane. The direction of the arcs is implicit in the linear ordering of the vertices 
and accordingly omitted. 

Definition 1. An RNA structure (of pseudoknot type k — 2), Sk. n , is a digraph in which all 
vertices have degree < 1, that does not contain a k-set of mutually intersecting arcs and 1-arcs, 
i.e. arcs of the form (i, i + 1), respectively. We denote the number of RNA structures by Sfe(n) and 
the number of RNA structures with exactly £ isolated vertices by Sk(n, £), respectively. We call an 
RNA structure restricted iff it does not contain any 2-arcs, i.e. an arc of the form + 2). 



FIGURE 2. RNA structures represented as diagrams, i.e. arcs over {l,...,n} in the 
upper half-plane, fc-noncrossing digraphs are precisely those which have no fc-set of 
mutually intersecting arcs. 2-noncrossing diagrams without 1-arcs (top) correspond to 
secondary structures. 3-noncrossing diagrams without 1-arcs (bottom) represent RNA 
structures with pseudoknots. 



We derive the enumeration of RNA structures in three steps. First we establish a bijection from 
fc-noncrossing digraphs into a certain class of walks. Secondly we will use the reflection principle in 
order to count these walks. Thirdly we enumerate all walks subject to specific conditions recruiting 
a certan involution idea. Let us first discuss two basic concepts needed for our arguments. 

Young tableaux and the RSK algorithm. A Young diagram (shape) is a collection of squares 
arranged in left-justified rows with weakly decreasing number of boxes in each row. A Young 
tableau is a filling of the squares by numbers which is weakly decreasing in each row and strictly 
decreasing in each column. A tableau is called standard if each entry occurs exactly once. An 
oscillating tableau is a sequence = (jr,(i,...,(i n = of standard Young diagrams, such that 




1 2 3 4 5 B 7 8 ) TO 11 12 13 




1 2 3 4 S 6 7 8 9 f€ 11 12 13 
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for 1 < i < n, fji 1 is obtained from /Lt 8 1 by either adding one square or removing one square. For 
instance the sequence 



Figure 3. 1 



is an oscillating tableaux. 

The RSK algorithm is a process of row-inserting elements into a tableau. Suppose we want to 
insert k into a standard Young tableau A. Let Xi.j denote the element in the «th row and jth 
column of the Young tableau. Let i be the largest integer such that Ai^-i < k. (If Ai,i > fc, then 
i = 1.) If Ai,, does not exist, then simply add k at the end of the first row. Otherwise, if Ai j 
exists, then replace Ai,; by k. Next insert Ai^ into the second row following the above procedure 
and continue until an element is inserted at the end of a row. As a result we obtain a new standard 
Young tableau with k included. For instance inserting the number sequence 5, 2, 4, 1, 6, 3 starting 
with an empty shape yields the following sequence of standard Young tableaux: 



wim an empty snapc yieias me ioiidwuie sequence or siaiiaara laung r-aoiesui: 
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Figure 4. 

Symmetry groups and Weyl- chambers. We consider the lattice Z fc_1 and walks in Z k ~ 1 
having the steps s contained in {±ei,0 | 1 < i < k — 1}, where denotes the ith unit vector. 
That is for a,b G Z fe_1 a walk from a to b, j a .b, of length n is an n tuple (s±, . . . ,s„) where 
Si G {ie^, | 1 < i < k — 1} such that b = a + J2)l=i s h- We set -f a ,b{s r ) = a + Y^h=i Sfl e 
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i.e. the element at which the walk (s±, . . . , s r ) resides at step r. The symmetric group S k -i acts 
on Z fe_1 via a{x i ) 1 <i< k ^ 1 = (av-i(j))i<i<fc_i. We set Ek-i = (e« | 1 < i < k - 1), where 
ei(xi, . . . ,X{, . . . ,Xk-i) = (xi, . . . , —Xi, . . . ,Xfc-i). As shown in the Appendix {e;er | a 6 Sk-i, £j G 
£fc_i} carries a natural group structure via (eia)-(eja') = ei0~€ja~ l oo' = eita-i^aa' . This group, 
denoted by Bk_i, is generated by Mk-i = {ei,Pj | 2 < j < k — 1}, where pj = — 1), i.e. p 
transposes the coordinates Xj-\ and Xj. By definition B^-i acts on the set 

(2.2) Afe_i = {±e t | 1 < i < k - 1} U {e l ±e j | 1 < i,j < k - 1} 

and we call Afe_i the set of roots. We observe that there exists a bijection between A' k _ 1 = 
{ei, ej — | 2 < j < fc — 1} and the set of generators Mk-i which maps each root a G A'j,^ 
into a corresponding reflection (in particular: & k -i is generated by reflections) 

(2.3) {e^ej-ej-i \ 2 < j < k - 1} — > {e 1; Pj \2<j<k-l}, ( f3 a = x >-> x - 2 ^ 



(a, a) 

where (x,x') denotes the standard scalar product in Z fc_1 . It is clear that AJ.^ is a basis of Z fc_1 . 
We refer to the sub spaces (e.;) for 1 < i < fc — 1 and (e^ — e^-i) for 2 < j < k — 1 as walls. A 
Bfc_i-chamber is defined as the set of x G Z fe_1 with the property that (a, x) > for all a G Afe_i. 
We denote the Weyl chamber 

(2.4) C = {x G Z fe_1 | < < x k _2 <■■■ <xi} . 

For RNA secondary structures we have k — 1 = 1, and Bi = E\ = {ei,l} and A' = {ei}. For 
3-noncrossing RNA we have k — 1 = 2 and B2 = E2 x v ^2 — D4 (where ip : S2 — > Aut(i?2)) is the 
dihedral group of order 8. 



The following theorem is the first step for the enumeration of RNA structures. It will allow to 
interpret a certain class of digraphs as walks in Z k ~ 1 which remain in the interior of the Weyl 
chamber Co- The result is due to Chen et al [55], where it is formulated for matchings. The 
original bijection between oscillating tableaux and matchings is due to Stanley and was generalized 
Sundaram [23] . We give a proof of Theorem [T] in the Appendix. 

Theorem 1. |28j There exists a bijection between k-noncrossing digraphs and walks of length n in 
Z fc_1 which start and end at a = (k — 1, fc — 2, . . . , 1) having steps 0, ie^, 1 < i < k — 1 such that 
< Xk-i < ■ ■ • < Xi at any step. I.e. we have a bijection 

(2.5) Sn.fe — ► {la, a \ la, a remains inside the Weyl- chamber Co} , 

where 5 n , k denotes the set of k-noncrossing digraphs of length n. 
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10 11 12 13 14 15 16 17 



FIGURE 5. A 5-noncrossing digraph. In the text we show how to derive from this 
digraph an oscillating tableau and subsequently the corresponding walk y a , a in Z 4 . 



a \m} 
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Figure 6. 1 



The 5-noncrossing digraph in Figure [S] corresponds to a oscillating tableau as follows: from right 
to left start at vertex 17, which is a terminus. The corresonding origin is 7 which is inserted via 
the RSK algorithm into the empty shape. Next insert the origin corresponding to 16 and 15, 
respectively. At 14 and 13 nothing happens since they are isolated vertices. At 12 origin 3 is 
inserted and 11 is isolated. 10 is an origin of an arc and accordingly removed from the tableaux. It 
is now clear how to proceed. The walk 7 a a is obtained from the tableau as follows: its Xi-coordinatc 
is the number of squares in the i-th column, i.e. 7 QiQ is given by 

(4, 3, 2, 1), (5, 3, 2, 1), (6, 3, 2, 1), (6, 4, 2, 1), (6, 4, 2, 1), (6, 4, 2, 1), (6, 4, 3, 1), (6, 4, 3, 1), (5, 4, 3, 1), 
(5, 4, 3, 2), (6, 4, 3, 2), (6, 4, 3, 1), (6, 4, 3, 1), (6, 4, 2, 1), (6, 4, 2, 1), (6, 3, 2, 1), (5, 3, 2, 1), (4, 3, 2, 1) . 

We show in the appendix in detail why this is a bijection. 

We next discuss the reflection principle. The key idea is to count walks that remain in the interior 
of a Wcyl chamber by counting all walks. Then one utilizes the fact that all walks that touch 
a wall at some step can be paired and eventually cancel themselves in the enumeration. The 
particular way to obtain this pairing is by reflecting the walk at the corresponding wall. The 
following observation is essential for the reflection principle, formulated in Theorem [2] below. 
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Lemma 1. Let A' k _ 1 = {ei,ej — e.j-i | 2 < j < k — 1}. TTien every walk starting at some lattice 
point in the interior of C having steps ±ej,0 that crosses from inside C into outside C touches a 
subspace (a — e^-i | 2 < i < k — 1) or (e.; | 1 < i < k — 1). 

Let T n (a,b) be the number of walks ^ a ,b- For a, b £ Co (eq. (|2.4p ) let T+(a,6) denote the number 
of walks j a ,b that never touch a wall, i.e. remain in the interior of Co- Finally for a,b £ Z fc_1 , 
let r~(a, 6) denote the number of walks 7a : b = (si, . . . , s„) that hit a wall at some step s r . t{(3) 
denotes the length of f3 € Bfc_i. For a = b = (k — 1, . . . , 1) we have according to ThcoremQ] 

(2.6) r+(a, a) = G„ jfe , 
where G„,fc = |9„,fc|. 

Theorem 2. (Reflection-Principle) 10] Suppose a, 6 £ Co, i/ien we /iaue 

(2.7) r+(a,6)= X! (-l^Fn^a)^) . 

PeB k _ 1 

Theorem [5] allows us to compute the exponential generating function for T^(a,b), which is the 
number of walks from a to b, that remain in the interior of Co [11] . 

Lemma 2. Let L r {2x) = 5Zj>o x +j /j-( r + j)! 6e </ie hyperbolic Bessel function of the first 
kind of order r. Then the generating functions for the numbers of k-noncrossing digraphs of length 
n and for k-noncrossing digraphs of length n without isolated points, T+(a, b) and T'^(a,b) are 
given by 

(2-8) E r «( a ' 6 )^[ = ^det[L ai _ b] (2x)~L at+bi (2x)]\1jl 1 

n>0 

(2-9) T, T 'n( a ^)— = det[I ai . bj (2x)-I ai+b] (2x)}\^ 1 . 

n>0 

Now we can combine our results in order to enumerate fc-noncrossing digraphs using the bijection 
between digraphs and walks. Theorem [T] implies that the number of fc-noncrossing digraphs is 
equal to 1^ (a, a), the number of lattice walks in Z fc_1 of length n that remain in the interior of Co 
from {k — 1, ... 1) to itself with steps 0, ±e,, 1 < i < k — 1. According to Lemma [2] the generating 
functions for walks with steps = ±1,0 and e.; = ±1, where 1 < i < k — 1 are given by 

(2.10) e x detfli-j- (2x) - (2x)]\^l l and det^- (2s) - I i+j (2x)]\^l 1 , 
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respectively. Let f k (n, £) denote the number of fc-noncrossing digraphs with £ isolated points. Then 
(2.H) fk(n,t)= rjf k (n-£,0) 

(2.12) det[/i_ J -(2x) - l l+3 (2x)}\^ 1 = ]T f k (n,0) ■ — 

n>l 

(2.13) e-defcft-^) - Ii+^x)}^ = (£, |-)(£ ^ = £ £/*M) \' • 

£>0 ' n>l " ti>1 U=0 J n ' 

In particular we obtain for k = 2 and fc = 3 



(2.14) 



/a(n,*) 



C(n-£)/2 and h{n,£) = 



C n-t | C n- 
— t — o 



where C m denotes the mth Catalan number. 



3. RNA STRUCTURES 



In this section we derive the generating function for RNA structures. The successful strategy 
consists in counting the "wrong" object "multiple" times. To be precise we will enumerate all 
fc-noncrossing digraphs with j 1-arcs by relating them to a family whose cardinality we can easily 
compute. We denote the number of RNA structures with exactly £ isolated vertices by Sk(n, £). 
Suppose k > 2 and let S„ k {f-ij) be the set of all fc-noncrossing digraphs having exactly £ isolated 
points and exactly j 1-arcs, where a Tare is an arc of the form (i, i + 1), 1 < i < n — 1. Setting 
&k(n,£,j) = |Sn,fe(^>i)l) w e have in particular S k (n,£) = G k (n,£,0). 

Theorem 3. Let k £ N, k > 2, let C m denote the m-th Catalan number and f k {n, £) be the number 
of k-noncrossing digraphs over n vertices with exactly £ isolated vertices. Then the number of RNA 
structures with £ isolated vertices, S k (n,£), is given by 

(n-l)/2 / _h\ 

(3.1) S k (n,£)= £ C-^Tfe )A( n - 2 M). 

b=0 ^ ' 

where f k (n — 26, £) is given by the generating function in eq. (|2.12p . Furthermore the number of 
k-noncrossing RNA structures, S k (n) is 

L«/2J , _,s (n-2b \ 

(3.2) S*(n)= £(-l) 6 P ^/»(n-2i,/) 

h=o ^ ' I e=o ) 

where {^1=^ /fc( n ~ 26, £)} is given by the generating function in eq. (|2.13| . 
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Proof. We first prove 

(3.3) Yl (l) G *M,i) = ( H ~ b ) fk(n - 2b, £) 

j>b ^ ' ^ ' 



For this purpose we construct a family 5" of S„ ^-digraphs, having exactly £ isolated points and 
having at least b 1-arcs as follows: select (a) b 1-arcs, and (b) an arbitrary fc-noncrossing digraph 
with exactly £ isolated points over the remaining n — 2b vertices. Let "5 be the resulting family of 
digraphs. 

Claim 1. Each element 9 6 J is contained in S n ,k(£,j) for some j > b. 

To prove this we observe that a 1-arc cannot cross any other arc, i.e. cannot be contained in a 
set of mutually crossing arcs. As a result for fc > 2 our construction generates digraphs that 
are fc-noncrossing. Clearly 9 has exactly £ isolated vertices and in step (b) wc potentially derive 
additional 1-arcs, whence j > b. 
Claim 2. 

(3.4) |J| = ( n , l) \}k{n-2b,£) . 



b 

Let \(n, b) denote the number of ways to select b 1-arcs over {1, . . . , n}. We observe that X(n, b) = 
( n ^ b )- Identifying the two incident vertices of an 1-arc we conclude that we can choose the b 
1-arcs in (™^ & ) ways. Obviously, £ isolated vertices can be obtained in (™~J, 2fc ) different ways and 
it remains to select an arbitrary fc-noncrossing digraph with exactly £ isolated points over n — 2b 
vertices. The number of those is given by f(n — 2b, £) which we can compute via Lemma[21 whence 
eq. (|3.4|) and Claim 2 is proved. 

In view of the fact that any of the fc-noncrossing digraphs can introduce additional 1-arcs we set 

= {9 G £F 9 has exactly j 1-arcs} . 

Obviously, 7 = Uj^b^C?)- Suppose 9 G ~f{j)- According to Claim 1, 9 € 9n,k(£, j) and furthermore 
9 occurs with multiplicity in ? since by construction any 6-element subset of the j 1-arcs is 
counted respectively in 5F. Therefore we have 

(3-5) \?(j)\ = (i)Gk(n,£,j) 



and 



( 3 M k (n,£,j) = J2 I = ( U b b )h(n - 2b, £) 



j>b ^ ' j>b 
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whence eq. p.3[) . We next set Fk(x) = J2j>o Gk(n,£, j) xK Taking the 6-th derivative and let 
x = 1 we obtain 

(3-6) (l) = WiW Mj j)l'*-*. 



6! fe w ^ U 

Claim 2 provides an interpretation of the r.h.s. of eq. (|3.6 

— b 



(3.7) g (jjGfcM, j) P- 6 = (" 6 jA(n - 26,*) 



In order to connect Fk(x) and A.FW(1) we consider the Taylor expansion of Fk(x) at x = 1 and 
compute 

F fc (a ! ) = ^iFW(l)(x-l) b = 2 U(n-2M)(*-l) b . 

6>0 ' 6=0 ^ ' 

In view of Sk(n,£) = Gk(n,£,Q) is the constant term of Fk(x), i.e. Ffe(O), whence 

(n-f)/2 / _ JA 

(3.8) S fc (M) = E (- 1 )" \ )/fc(n-2M). 

&=o ^ ' 

It remains to prove eq (|3 . 2|) . Summing over all possible values of isolated vertices, we get 

n (n-*)/2 , A L»/2J / , N (n-2b } 

&(»)=E E (-in^ )h(n-2b,t)=J2(-l) b ( n - b ) \j2Mn-2b,£)\ 
where Y^eZo b fk ( n ~ 26, €) is given by eq. (|2.13|) and the proof of the theorem is complete. □ 



n 


12 3 
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10 


11 


12 


13 


14 


15 


S 3 (n) 


1 1 2 


5 


13 


36 


105 


321 


1018 


3334 


11216 


38635 


135835 


486337 


1769500 



Table 1. The first 15 numbers of 3-noncrossing RNA structures. 



A first implication of Theorem [3] is a new proof for Waterman's formula [27j for the number of 
RNA secondary structures with exactly k base pairs or cquivalcntly having I = n — 2k isolated 
vertices. 
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Corollary 1. The number of RNA secondary structures having exactly £ isolated vertices, S 2 (n, £), 
is given by 



Furthermore S 2 (n, satisfies the recursion 

(3.10) (n-£)(n-^ + 2) -S 2 (n,£) - (n + f)(n + £ - 2) • S 2 (n - 2, = 



Proof. We actually give two independent proofs of eq (|3 . 9|) : the first being a direct computation 
based on eq. (|3.1j) and the second using the recursion in eq. (|3.10| derived by Zeilbcrger's algorithm 
[20] . Let = k we compute 



(n-l)/2 



S 3 (n,*) = E (- 1 ) 6 



fc=0 
k 



n — b\ ( n — 2b 
b [ £ 



1 



f-^ ' W(n-2fc)! (ft-6 + l)!(*-6)! 

(n-fc-1)! * (n-6)! fc!_ 

(n - 2k)\ -k\f^ ' (k-b+ l)!(n - fc - 1)! 6!(fc - &)! 



1 { n — k\ v^U , _ „ t /fc\ / n — 6 



y ' n-k\ k J f-^\bj\k-b+l 

1 (n — k\ (n — k 
n-k\k+lj\ k 
1 fn — k\ (n — k — 1 



k\k + l)\ k-l 



As for the second proof we use S 2 (n,.£) = SfcL(/^ 2 (— l) 6 ( n 6 b )f^i n ~ 2b, £) as the input for Zeil- 
berger's algorithm [SO] and obtain that S2(n,£) satisfies the recursion formula 



(3.11) (n - £)(n - £ + 2) ■ S 2 (n, £) - (n + £){n + £ - 2) • S 2 (n - 2,£) = 
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Using a bijection between RNA secondary structures and linear trees Waterman computed in [57] 
the number of RNA secondary structures with exactly h arcs, s(n, h) 

\(n- h\ fn-h-1 

It follows by direct computation that S2(n, n — 2h) = s(n, h) = ( n ^_7 ) satisfies the recur- 

sion in cq. (|3.11[) . from which we can conclude S2(n, £) = s(n, (n — £)/2). □ 

Corollary 2. The number of 3-noncrossing RNA structures having exactly I isolated vertices, 
S 3 (n, £), is given by 

(n-t)/2 



(3.13) s 3 (M)= £ (-i- 



b=0 



r ' s ' f ''2 

2 2 ~r z 9 H 1 



Using the expression of Corollary [5] for S 3 (n, £) as an input for Zcilberger's algorithm [5D] we derive 

Corollary 3. The number of 3-noncrossing RNA structures having exactly £ isolated vertices, 
S 3 (n,£), satisfies the A-term recursion 

(3.14) Pl (n) S 3 (n - 6, £) - p a (n) S 3 (n - i,£) - p 3 (n)S 3 (n - 2, f) + p 4 (n) S 3 (n, *) = , 
where the coefficients p\(n,£), p2(n,£) p 3 (n, £) and pa{ti,£) are given by 
Pi(n,£) = -n(n-l)(n-10 + £)(n-4: + £)(n-8 + £) 

p 2 (n, £) = ^n(n - 3)(13n 3 - 126n 2 + 13n 2 £ - 88n£ + 392n + 3n£ 2 + 216£ - 384 - 42£ 2 + 3^ 3 ) 
p 3 (n, £) = (n - l)(-n - 2)(13n 3 - 30n 2 - 13n 2 £ + 8n+ 16n£ + 3n£ 2 + 30£ 2 - 72£ - 3£ 3 ) 
p 4 (n, £) = (n- 3)(-n - 2)(n - £)(n - £ + 6)(n - £ + 4) . 



Theorem [3] immediately allows us to derive the generating function for circular fc-noncrossing 
RNA structures. Circular RNA structures are fc-noncrossing digraphs without arcs of the form 
(l,n), representing molecular structures over circular sequences. In circular sequences the arc 
(n, 1) is considered a bond of the primary sequence and consequently does not occur as an arc 
in the corresponding digraph representation. Suppose fc > 2 and let 9n\(^j) be the set of all 
fc-noncrossing digraphs having exactly £ isolated points and exactly j 1-arcs, where a 1-arc is an 
arc of the form (i,i + 1), where i is considered modulo n. We set G^\n,£, j) = \3^\{£,j)\- 
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Theorem 4. Let k G N, k > 2, then the number of circular k-noncrossing RNA structures, with 
exactly £ isolated vertices S^\n,£), is given by 

(n-H)/2 



(3.15) S[ c \n,£)= J2 (-!)' 



6=0 



(n-2)-(6-l) 
6-1 



71 — 6 

6 



f k (n-2bj) 



where X {c) (n, 6)(1, 0) = 0, A (c) (n, 6)(1, 1) = 1, A (c) (2,0) = and A (c) (2,2) = 1. 



L«/2J 

(3.16) s£ c) (n) = E^ 1 ) 6 

6=0 



(n - 2) - (b - 1)\ t (n - 6 
6-1 



'n-26 



E fk(n-2b,£) 



where fk{ n ~ 26, £) is given by eg. (|2 . 13[) . 



Proof. For circular RNA structures the 1-arcs are considered modulo n. Again wc derive a family 
3" of Sn^-digraphs, having exactly I isolated points and at least 6 1-arcs. We select (a) 6 1-arcs, 
and (b) an arbitrary fc-noncrossing digraph with exactly I isolated points over the remaining n — 26 
vertices. In complete analogy we derive that each element 6 £ 5F is contained in < 3if\{^, j) for some 
j > 6. Let A' ' (n, 6), denote the number of ways to select 6 1-arcs over {1, . . . , n} including the arc 
(n, 1). Then A' c )(n, 6) is given by 



(3.17) 



A (c) (n,6) 



(n-2)-(6-l) 
6-1 



n — b 
b 



where (n,b) (1,0) = 0, X^ (n, 6)(1, 1) = 1, A( c )(2,0) = and A< c )(2,2) = 1. Indeed, either 
the arc (n, 1) is selected in which case we are left with exactly (^™ _2 t~P ) ways to select the 
remaining 1-arcs or (n, 1) is not selected, in which case according to Theorem [3] there are exactly 
(™6 b ) wa Y s to select the 1-arcs. Therefore we obtain 



(3.18) 



\ - 3 



^ \b 

j>b 



n,£,j) 



(»-2)-(6-l) 
6-1 



n — 6 
6 



h(n-2b,£) 



In complete analogy to the argument in Theorem [3] we can conclude 



(n-e)/2 



(3.19) 



si c W)= E 



6=0 



(n - 2) - (6 - 1) 
6-1 



n — b 
b 



fk(n-2bj) 



Eq. (|3.16[) follows analogously and the proof of the theorem is complete. 



□ 
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4. Restricted RNA structures 



We now generalize the ideas in Section [3] for the enumeration of restricted RNA structures. A 
restricted RNA structure is an RNA structure without any 2-arcs, i.e. arcs of the form (i,i + 2). In 
this case we need the condition k > 2 instead of k > 2, since our construction can produce 2-sets 
of mutually crossing arcs. Let Sn,k(£, ji, j 2 ) be the set of all fc-noncrossing digraphs having exactly 
£ isolated points and exactly ji and j 2 1-and 2-arcs. We set Gk(n,£, ji, j 2 ) = |Sn,fc(^i ji> h)\- I n 
particular we have Gk(n,£,0, 0) = sL r (n,£). 

Theorem 5. Let k G N, k > 2. Then the numbers of restricted RNA structures S k r (n,£) and 
S k r (n) are given by 

(4.1) S[ r) (n,£) = Yl (-l) bl+b2 Hn,b ll b 2 )f k (n~2(b 1 +b 2 ) 1 £) 

bl>0,b 2 >0 

|ra/2j (n-2(b 1 +b 2 ) "| 

(4.2) Si r \n) = (-l) bl+b2 HnM,b 2 )\ £ f k (n - 2(6i + b 2 ),£) \ . 

bi>0,b 2 >0 [ 1=0 J 

Here X(n,b±,b 2 ) satisfies the recursion 

(4.3) A(n, 61, 6 2 ) = A(n - 2, 61 - 1, 6 2 ) + A(n - 1, 61, 6 2 ) + A(n - 4, 61, 62 - 2) + A(n - 3, h,b 2 - 1) 

and £/ie initial conditions for eq. (|4.3[) are A(n, 0,0) = 1, A(n, 61, 0) = ( n ~ b bl ) , A(n, 0, b 2 ) =7(71, b 2 ) 
and j(n, 1) = for n = 1, j(n, 1) = n — 2 /or n > 2 and 7(71, 2) = /or n = 2, 3. 



Proof. Suppose A(n, b\,b 2 ) is the number of ways to select exactly 61 1-arcs and b 2 2-arcs over 
{1, . . . ,n} vertices. 

Claim. A(n, b\, b 2 ) satisfies the recursion of cq. (|4.3[) with the respective initial conditions, and we 
have 

(4.4) (l 1 ) ({ 2 ) G k(n,£,h,j2) =A(n, 61,62) f k(n - 2(h + b 2 ),£) . 

3i>6i,J 3 >6a V 1/ V 2/ 

In analogy to the proof of Theorem[3]we derive a family 3" of Sn^-digraphs, having exactly £ isolated 
points and at least b\ and b 2 1-arcs and 2-arcs, respectively. We first prove that this construction 
generates elements of 9n,k(£, 31,32) an d then express \J\ via the numbers Gk(n, £, ji, j 2 ). We 
select (a) 61 1-arcs and b 2 2-arcs and (b) an arbitrary fc-noncrossing digraph over the remaining 
n — 2(6i + b 2 ) vertices with exactly £ isolated points. Let 3 be the family of digraphs obtained this 
way. 
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Claim 1. Each element 9 6 5" is contained in S n ,k(£, ji, 32) for some ji > 61 and j'2 > ^2- 

To prove this we observe that any 1-arc or 2-arc can only cross at most one other arc. Therefore 

Tares and 2-arcs cannot be contained in a set of more than 2-mutually crossing arcs. As a result, 

for k > 2 we generate digraphs that are fc-noncrossing. Clearly 8 has exactly I isolated vertices 

and in step (b) we potentially derive additional 1-arcs and 2-arcs, whence ji > b\ and 32 > 62, 

respectively. 

Claim 2. 

(4.5) I J| = A(n, 61, 62) fk(n - 2(bi + 62), i) ■ 

We prove that the number of ways to select 1 and 2-arcs satisfies the recursion in eq (|4.3[) by 
induction on n. For the induction step we distinguish the following cases: 

Case 1. The arc (1, 2) is selected. Then we have A(n — 2, 61 — 1, 62) ways to select (61 — 1) 1-arcs 
and 62 2-arcs over the vertices {3, . . . , n}. 

Case 2. The arc (1,2) not selected. Then we distinguish the scenarios: (1,3) is selected and 
(1,3) is not selected. In the latter case we have X(n — 1,61,62) ways to choose 61 1-arcs and 62 
2-arcs over the vertices {2, . . . ,n}. Suppose (1,3) is selected. Then we have either that (2,4) is 
selected, in which case we can select the remaining 61 1-arcs and 62 2-arcs over {5, . . . , n} in exactly 
A(n — 4, 61, 6 2 — 2) different ways. In case (2, 4) is not selected we can freely choose 61 1-arcs and 
(6 2 — 1) 2-arcs over {4, . . . ,n} i.e. there are A(n — 3,61,62 — 1) ways. Therefore we derive the 
recursion 

A(n, 61, 6 2 ) = A(n - 2, 61 - 1, 6 2 ) + A(n - 1, 61, 6 2 ) + A(n - 4, 61, 6 2 - 2) + A(n - 3, 61, 6 2 - 1) . 

As for the intial conditions, we have are A(n, 0,0) = 1, A(n, 61, 0) = ( n ^ 1 ), A(n, 0, 6 2 ) = 7(71,62) 
and 7(h, 1) = for n = 1, 7(h, 1) = n — 2 for n > 2 and 7(71, 2) = for n = 2,3. It remains 
to select an arbitrary fc-noncrossing digraph with £ isolated vertices over n — 2(6i + 62) vertices. 
According to Lemma [2] the latter number is given by fk{n ~ 2(6i + 62),^), whence eq. (|4.5[1 and 
Claim 2 is proved. In view of the fact that any of the fc-noncrossing digraphs over n — 2(6i + 62) 
vertices can introduce additional 1-arcs or 2-arcs, we set 

9^.71 1. 72) = {0 € y I 6 has exactly ji 1-arcs and J2 2-arcs} . 

Obviously, we have the partition J = U^^, j2 > b2 ^{h,k)- Suppose G ^{31,32)- According 
to Claim 1, 6 € 9n,k(P, ji, 32) and furthermore 8 occurs with multiplicity (^) (^) in "3 since by 
construction any 61-element subset of the j\ 1-arcs and 62-element subset of the 32 2-arcs is counted 
respectively in 3\ Therefore we have 

(4.6) \Hh,h)\ = fr) ^G k (n,£, J1 , j2 ) 



18 



EMMA Y. JIN, JING QIN AND CHRISTIAN M. REIDYS * 



and 



E 

jl>bl, J2>&2 



3i\/32 
h) \b 2 



jl>bi,j2>b 2 

= X(n,b 1 ,b 2 )f k (n-2(b 1 +b 2 ),£) . 



We next set F)-(x,y) — 5Zj 1>0 J2j 2 >o ^k(n, £, ji, j2)x^ x y^ 3 . Taking the &i-th and 6 2 -th derivatives 
w.r.t. x and y we obtain 



(4.7) 
Then we have 



1 1 p (biM) (u 



Y G k(n,l > ji > j2)x jl y j2 

Jljj2>0 



E 

ji>bi> J2>b2 



E 

&i>0,b 2 >0 



6i7 U 



E 



1 — 6l 1 i2-&2 



Jl \ /J2 



Gk(n,e,ji,j 2 ) 



{x-l)^{y-l) h 



Jb\ I \b 2 

3i>bi,32>b2 

Y A(n, 6i,6 2 ) / fc (n - 2^ + 6 2 ), £) (a: - l) bl (y - 1) 

6i>0,b 2 >0 



By construction G(n,£, 0,0) is the constant term of the Fk(x,y). That is, the number of k- 
noncrossing RNA structures with £ isolated vertices and no 2-arcs is given by 



G(n,£,0,0)= Yl (-l) bl+b2 Kn,b u b 2 )f k (n-2(b 1 + b 2 ),£) 

6i>0,6 2 >0 



(4.8) 



and taking the sum over all £ eq. (|4.2|) follows 

[n/2j ( n- 2 (b 1 +b 2 ) 

Si r) H= E (-l) 6l+b2 A(n,6!,6 2 )i Y Mn-2(b 1+ b 2 ),£) 

bi>0,& 2 >0 [ f=0 

where |X)™=o 2 ' fcl+b2 ' /*-'( n — 2(&i + b 2 ),£) j is given by eq. (|2.13[) and the proof of the theorem is 
complete. □ 



n 


12 3 4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


S r 3 (n) 


1112 


5 


14 


40 


119 


364 


1145 


3688 


12139 


40734 


139071 


482214 



Table 3. The first 15 numbers of 3-noncrossing restricted RNA structures. 
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5. Appendix 

Proof of Theorem [TJ Suppose we have two shapes // C /j 1-1 and Tj_i is a standard Young 
tableau of shape [i 1 . We first observe that there exists a unique j and a unique Tj such that 
Ti_i is obtained from T by row-inserting j with the RSK algorithm. 

Suppose fi 1 differs from // in the first row. Then j is the element at the end of the first row in 
Ti—i. Otherwise suppose I is the row of the square being removed from Tj_i. Remove the square 
and insert its element x into the (£— l)-th row at precisely the position, where the removed element 
y would push it down via the RSK-algorithm. That is y is maximal subject to y < x. Since each 
column is strictly increasing y always exists. Iterating this process results in exactly one element 
j being removed from T and a new filling of Hi-i, i.e. a unique tableau T,_i. By construction, 
inserting j with the RSK algorithm produces XJ_i. 

Claim 1. There exists a bijection between the set of oscillating tableaux of length n and digraphs 
with vertices of degree < 1 . 

Given an oscillating tableau (yu l )™ =0 (/•«* differs from yti 4 1 by at most one square), we recursively 
define a sequence (Go, To), (Gi, T), . . . , (G n ,T n ), where Gi is a digraph and T is a standard Young 
tableau. We define Go to be the digraph with empty edge-set and To to be the empty standard 
Young tableau. The tableau T is obtained from T_i and the digraph Gi is obtained from Gj_x 
by the following procedure: 

1. (Insert origins) For [i l D then T is obtained from T_i by adding the entry i in the square 

2. (Isolated vertices) For p? = /i 1 ^ 1 then set T = T%-\ 

3. (Remove origins) For fj, % C then let T be the unique standard Young tableau of shape // 
and j be the unique number such that Tj_i is obtained from T by row-inserting j with the RSK 
algorithm. Then set E Gi = E Gi _ 1 U {(j, i)}. 

Obviously, G„ is a digraph, and the set of i where // = /i 4 " 1 equals the set of isolated vertices of 
G„. By construction each entry j is removed exactly once whence no edges of the form and 
(j, i') can be obtained. Therefore G n has degree < 1 and we have a well defined mapping 

'■ {(Mi)£=o I (^i)i=o is an oscillating tableau} — > {G„ | G„ is a digraph with degree < 1} . 

It is clear from the procedure that G n is a labeled graph and (3 is injective. To prove surjectiv- 
ity we observe that each digraph G„ induces an oscillating tableau as follows. We set /i Gn = 
and T n = 0. Starting from vertex i = n, n — 1, . . . , 1, we derive a sequence of Young tableaux 
(T„, T„_x, . . . , T ) as follows: 

I. If i is an terminus of an G„-arc add j via the RSK-algorithm to T set 3 fi % Gn to be 
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the shape of T^_i (corresponds to (3)) 

II. If i is an isolated G„-vertex set //q 1 = /j,q (corresponds to (2)) 

III. If i is the origin of an G„-arc (i, k) let /i 1 ^ 1 C fj, l G be the shape of T,-_i, the standard Young 
tableau obtained by removing the square containing i (corresponds to (1)). 

Then we have P((fJ-G n )o) = whence (3 is surjective. 

Claim 2. G n is fc-noncrossing if and only if all shapes \i % in the oscillating tableau have less than 
k rows. 

From Claim 1 we know (3~ 1 (G n ) = (0 = fx , /i 1 , . . . fi n = 0), so it suffices to prove that the 
maximal number of rows in the shape set /3 _1 (G„) is less than k. First we observe that the arcs 
Ji), ■ ■ ■ {ii,je) form a ^-crossing of G n if and only if there exists a tableau T such that elements 
ii, 12, ■ ■ ■ H are in the t squares of T and being deleted in increasing order %\ < %2 < ■ ■ ■ %t after- 
wards. Next, we will obtain a permutation 7Tj from the entries in each tableau Tj recursively as 
follows: 

1. If Tj_i is obtained from Ti by row-inserting j with the RSK algorithm, then 7Ti_i = mj. 

2. If Ti = T t -i, then Hi = 7Tj_i. 

3. If Tj_i is obtained from Ti by deleting the entry i, then iii-i is obtained from Ki by deleting i. 
If 7T = T\T2 ■ ■ ■ Tt, then the entries being deleted afterwards are in the order rt, ■ • .T2,Ti. 

Using the RSK algorithm w.r.t. the permutation ni, the resulting row- inserting Young tableau is 
exactly T. We prove this by induction in reverse order of the oscillating tableau. It is trivial for 
the case i = n. Suppose it holds for j, 1 < j < n. Consider the above three cases: inserting 
an element, doing nothing and deleting an clement. In the first case, the assertion is implied the 
RSK algorithm in the construction of the oscillating tableau. In the second case, it holds by the 
induction hypothesis on step j. 

Now it remains to consider the third case, that is, removing the entry from Tj to get Tj—±. Write 
iTj = x\X2 ■ ■ ■ Xpjyiy2 ■ ■ - Vq and 7Tj_i = x\X2 ■ ■ ■ x v y\])2 ■ ■ ■ y q - In view of step 3 j is larger than 
elements x\, X2, • ■ ■ , x p , yi, . . . y q . We need to prove that the insertion tableau Sj—i of 7Tj_i by the 
RSK algorithm is exactly the same as deleting the entry j in Tj. We proceed by induction on q. 
In the case q = 0, Tj is obtained from Tj-\ by adding j at the end of the first row. Suppose the 
assertion holds for q-l, that is Sj-\(xix 2 ■ ■■x v y 1 y 2 ■ ■ ■ y q -\) = Sj(xix 2 ■ ■ - x v jy Y y2 ■ ■ ■y g -i)\[~j~|- 
Consider inserting y q into Sj-i, via the RSK algorithm. If the insertion track path never touches 
the position of j, then Sj-i(x!X 2 ■ ■ ■ x p yiy 2 ■ ■ ■ y q -iy q ) = S j (x 1 x 2 ■ ■ ■ x p jyiy 2 ■ ■ ■ y q -iy q ) \ 
Otherwise, if the insertion path touched j and pushed j into the next row, then since j is 
greater than any other entry, j must be moved to the end of next row and the push process 
stops. Accordingly, the insertion path in Sj-±(xiX2 ■ ■ ■ x v y\y2 ■ ■ ■ Vq—i) is the same path as in 
Sj{x\X2 ■ ■ ■ x p jyiy2 ■ ■ ■ Uq-i) except the last step moving j to a new position j, so deleting j will 
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get Sj-i(xiX2-..x p y 1 y2...y q -iyq) = S j {x 1 x 2 ■ ■ ■ x p jyiy 2 ■ ■ ■ y q ~iy q ) \ According to Schen- 
stcd's Theorem, for any permutation 7r, assume A is the corresponding insertion Young tableau by 
using the RSK algorithm on it. Then the length of the longest decreasing subsequences of ir is the 
number of rows in A, whence the assertion. 

Now we can prove Claim 2. A diagraph is a ^-crossing if and only if there exists a iti which has 
decreasing subsequence of length t. And the insertion Young tableau of 7Tj is exactly the same with 
the labeled oscillating tableau X^. According to Schensted's theorem, 7r has a decreasing sequence 
of length £ if and only if rows of Ti is £. 

Claim 3. There is a bijection between oscillating tableaux with at most k — 1 rows of length n and 
walks with steps ie^, which stay in the interior of Co starting and ending at (k — 1, k — 2, . . . , 1). 
This bijection is obtained by setting for 1 < i < k — 1, xi to be the length of the ^-th row. By 
definition of standard Young tableaux, we have Ai > A2 > . . . A„ i.e. the length of each row is 
weakly decreasing. This property also characterizes walks that stay within the Weyl-chamber Co, 
i.e. where we have x± > X2 ■ • • > Xk-i > since a walk from (fc — 1, ... 2, 1) to itself in the interior 
of Co is a translation of a walk from the origin to itself in the region x\ > x 2 . . . > Xk—i > 0. In 
an oscillating tableau differs from fx 1 by at most one square and adding or deleting a square 
in the £-th row or doing nothing corresponds to steps ±ei and 0, respectively. Since the oscillating 
tableau is of empty shape, we have walks from the origin to itself, whence Claim 3 follows and the 
proof of the Theorem is complete. □ 

Proof of Lemma [TJ To prove the lemma we can w.l.o.g. assume C = Co = {{x\, . . . ,Xk-i) \ 
x\ > x 2 > ■ ■ ■ > Xk—i > 0}. Then the assertion is that every walk having steps ±ei,0 starting at 
a = (k — 1, k — 2, . . . , 1) that crosses from inside Co into outside Co intersects one of the sub-spaces 
(ei) or (ej — ej-i) for 2 < j < k — 1. This is correct since to leave C implies that there exists some 
i such that xi < afj+i. Let Sj be minimal w.r.t. a + s h ^ Co- Since we have steps ±ej,0 we 

conclude xu~i = or Xj = Xj~i for some 2 < j < k — 1, whence the lemma. □ 

Proof of Theorem [2l Totally order the roots of A. Let T~(a, b) be the number of walks 7 from 
a to b, a, b G Z fe_1 of length ?? using the steps s, s G {±ei,0} such that (7(s r ), a) ~ for some 
a G A (i.e. the walk intersects with the subspace (a)). According to Lemma Q] every walk that 
crosses from inside C into outside C touches a wall from which we can draw two conclusions: 



(5.1) 
(5.2) 



T n (a,b) = r+(a,6) +T n (a,b) 
/3^id r„(/3(a),6) = r- (/3(a), 6). 
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Claim. E /3e B fc _ 1 (-l)" (/3) r„(/3(a),6) = 0. 

Let (si, . . . , s n ) be a walk from /3(a) to b. By assumption there exists some step s r at which we have 
(l0(a),b( s r), ot) = 0, for a G A. Let a* be the largest root for which we have {pjM a ) 1 b{ s r)i°^) = 



and (3 a * (x) = x 



2{a*,x) 
(a* ,q* ) 



a* its associated reflection (eq. (|2.3[0 . We consider the walk 



(5.3) 



(/3a* (si), . . . , /3 a . (s r ), s 



r+1) • 



) 5 n) 



Now by definition (/3 Q » (si), . . . , (3 a * (s r ), s r +i, . . . , s„) starts at (/?„» °/3)(a) and has sign (— 
since £(/3) + 1 = £({3 a * ° /3)- Therefore to each element 7^( tt ),6 of r~(/?(a), 6) having sign (— 
there exits a J/3 a „0r a ),b £ L~(/3 a »/3(a), 6) with sign (— 1) £ (^) +1 and the claim follows. We immedi- 
ately derive 



£ (-!)'<« r„ 03(a), 6) 



^ =r-(/3(a),&) 

r+(a,6)+r-(a,6)+ £ (-1)'C0 r"(/3(a), 6) , 

/3eB fc _i,/3^icl 



(-!)*(« r-(/3(o),6)=0 



whence the theorem. □ 



Proof of Lemma [2} Let itj, 1 < i < k — 1 be transcendent variables and u = (ui)i ■ We define 



,b-a 



Y\i=\ u i z a * ■ Let F(x,u) be a generating function, then F(x,u)\ u b- a equals the family of 



coefficients a,i(u) at u b a of J2i>o a i{ u ) x% ■ We first observe 



T n (a,b) 
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The exponential generating function for T n (a, b) is 



n>0 



n>0 



fc-1 



i=l 



E 

Tl>0 



i=l 

2 II e*P( x ( u i + u i X )) 



We furthermore derive 



n >o 



fc-i 



E r "( a ^)^-= ex e (-i) i(/3) n ex p(^+^ 1 )) 

/3eB fc -i i=l 



and writing /3 = e/, <r we obtain 



E r "( a ' 6 )^T = e * E E^ sgn ^ II (expC^^+ur 1 )) 

ra>0 creSfc-i fc=l i=l V 

fc-1 fc-1 / 
= e :E ^ sgn(cT-) ^ ?7fe JJ ( exp(x(u i + u" 1 )) 

crGSfe_i /i=l i=l V 

= e x s S n ( a ) { II ( e Mx(u t + u^ 1 )) 

CTeS fe _i U=i V 

fc-i / 
J exp(x(uj + u" 1 )) 



e x detk—ixk— i[Iai— bj (2x) — lai+bj (2a 



I fc-i 



where 77/j = ±1 and the lemma follows. □ 
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